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Abstract 

We consider the communication scenario where a source-destination pair wishes to keep the in- 
formation secret from a relay node despite wanting to enlist its help. For this scenario, an interesting 
question is whether the relay node should be deployed at all. That is, whether cooperation with an 
untrusted relay node can ever be beneficial. We first provide an achievable secrecy rate for the general 
untrusted relay channel, and proceed to investigate this question for two types of relay networks with 
orthogonal components. For the first model, there is an orthogonal link from the source to the relay. 
For the second model, there is an orthogonal link from the relay to the destination. For the first model, 
we find the equivocation capacity region and show that answer is negative. In contrast, for the second 
model, we find that the answer is positive. Specifically, we show by means of the achievable secrecy 
rate based on compress-and-forward, that, by asking the untrusted relay node to relay information, we 
can achieve a higher secrecy rate than just treating the relay as an eavesdropper For a special class of 
the second model, where the relay is not interfering itself, we derive an upper bound for the secrecy 
rate using an argument whose net effect is to separate the eavesdropper from the relay. The merit of 
the new upper bound is demonstrated on two channels that belong to this special class. The Gaussian 
case of the second model mentioned above benefits from this approach in that the new upper bound 
improves the previously known bounds. For the Cover-Kim deterministic relay channel, the new upper 
bound finds the secrecy capacity when the source-destination link is not worse than the source-relay 
link, by matching with achievable rate we present. 

This work was presented in part at the 41st Asilomar Conference on Signals, System and Computers, November 2007, 
Information Theory and Application Workshop, ITA 2008, lanuary, 2008, and International Symposium on Information Theory, 
ISIT 2008, luly, 2008. This work is supported in part by the National Science Foundation with Grants CCR-0237727, CCF- 
051483, CNS-0716325, and the DARPA ITMANET Program with Grant W91 lNF-07- 1-0028. 
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I. Introduction 

A fundamental approach to information security is founded in information theory where the 
limits of reliable communication can be determined while keeping the information secret from 
eavesdropping node(s). This notion of secrecy was first proposed by Shannon [1]. In his work, 
Shannon assumed that the eavesdropper has perfect access to the signal transmitted from the 
source to the destination and determined that the rate of key must equal to the rate of data to 
ensure "perfect secrecy", i.e., in order for the data not to be leaked to the eavesdropper even if 
the eavesdropper has unlimited computational power. Wyner, in [2], pointed out that Shannon's 
assumption is pessimistic, as more often than not, the eavesdropper only has a noisy copy of 
the signal transmitted from the source and building a useful secure communication system per 
Shannon's notion is possible [2], [3]. 

Recent work in this area aims to find the secrecy capacity or capacity region for a variety of 
communication scenarios and channel models. A set of models follows the classical model of 
Wyner's wiretap channel [2], where an external eavesdropper is present in addition to legitimate 
parties. This line of work includes the multiple input multiple output (MIMO) wiretap channel 
[4]-[6], the wiretap channel with a cooperative jammer [7], the multiple access wiretap channel 
[8]-[ll], the MIMO broadcast channel with multiple legitimate receivers and an external eaves- 
dropper [12], the two-way wiretap channel [10], the relay channel with an external eavesdropper 
[13]. In these models, where the information leaked to the eavesdropper is a loss to the legitimate 
communication system, it was observed that legitimate parties could aid in enhancing secrecy 
by introducing intentional interference to the eavesdropper via cooperative jamming [7], [10], 
[14]. Another set of models deals with a more symmetric scenario, where each receiver of an 
intended message is also modeled as an eavesdropper for the remaining unintended messages in 
the system. This setting has been considered for the multiple access [15], broadcast [16], [17], 
and interference channels [18], [19]. In these models, one communication pair, in the interest of 
protecting its own information, may end up helping the other pair [19]. 

The focus of this work, while on cooperative communications, differs from the above models 
in that, it deals with a communication network whose nodes have different levels of security 
clearance. Examples like this exist in real life. In a government intelligence network or the 
network of a financial institution, not every node in the network is supposed to have the same 
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level of access to information, despite operating with agreed protocols and serving as relay 
nodes in the network. The question is whether these untrusted nodes should still participate in 
this cooperative communication network, or if they pose a "problem" when secret messages are 
to be transmitted, and hence, their cooperation should not be enlisted. The basic issue addressed 
here therefore, different from the previous models that aim to solve the co-existence problem of 
several communication pairs, is to resolve the conflict within one system. 

This paper focuses on the most basic model in this category in order to assess the effect 
of secrecy requirements upon cooperative communications. We consider the three node relay 
network, where the relay has a lower security clearance than the destination and is therefore 
untrusted. Reference [20], the first work that studied this model, shows that the secrecy capacity 
of this system is zero if the relay channel is degraded. The secrecy capacity equals that of the 
wiretap channel if the channel is reversely degraded, which means that the relay-to-destination 
communication is useless in this case as well [21]. In Section |IV] of this paper, we present 
yet another negative result: the relay node is again useless in a class of relay networks with 
orthogonal components [22], dubbed Model 1 in the sequel. In each of these references, the 
model turns out to be equivalent to the models of the first two categories, where the relay node 
is merely an eavesdropper, rather than a cooperating partner. 

In light of these results, one might be tempted to take a pessimistic view, and wonder whether 
there exists any situation where the cooperation of the untrusted relay might enable a higher 
secrecy rate than simply treating it as an eavesdropper. Interestingly, we find in this paper that 
the answer to this question is yes. This is shown for a class of relay networks with orthogonal 
components where the relay to destination link is orthogonal to that from the source [23], dubbed 
Model 2 in the sequel. Specifically, we observe that by performing compress-and-forward, the 
relay node can help increase an otherwise zero secrecy rate without having any idea what it is 
relaying. 

Once an achievable secrecy rate with the untrusted relay being useful is found, an upper bound 
on the secrecy rate is needed to assess how close the achievable strategy is to the optimum. There 
are two previously known upper bounds. Reference [13] provided an upper bound for the relay 
channel with an external eavesdropper. By assuming that this external eavesdropper receives the 
signals received and transmitted by the relay, we observe the model in [13] can be specialized 
into the model considered in this work and hence the bound in [13] can be readily applied. Alas, 
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this bound is not computable for the Gaussian case. A computable bound was provided for the 
Gaussian relay channel with a co-located eavesdropper in [21]. Alas, this bound does not depend 
on the condition of the relay-to-destination channel. Moreover, the noise correlation of the links 
may render the bound to be arbitrarily loose. In this work, we aim to derive an upper bound 
that improves the bound in [21] in these two aspects, and accomplish this goal for a class of 
untrusted relay channels. 

More specifically, the upper bound on the secrecy rate is derived for a special class of Model 2, 
where the relay is not interfering itself. The derivation of the upper bound entails the introduction 
of a second eavesdropper. Although in general, introducing a second eavesdropper can decrease 
the secrecy capacity, we prove that for the special class of channels question, doing so does not 
alter the secrecy capacity. The upper bound is then derived by removing the first eavesdropper 
at the relay and introducing correlation between the output seen by the second eavesdropper and 
other outputs of the channel, which tightens the upper bound as in other Sato-type bounds; see 
[5] for example. 

The merit of the new upper bound is demonstrated in two cases: First, for the Gaussian case 
of Model 2, we show that the new bound improves the previously known bounds. Second, for 
the Gaussian Cover- Kim deterministic relay channel introduced in [24], we show that the upper 
bound matches the achievable rate using compress-and-forward when the signal to noise ratio 
of the source-destination link is not worse than that of the source-relay link, thus, establishing 
the secrecy capacity. 

The remainder of the paper is organized as follows: Section |ll] describes the general relay 
network with a co-located eavesdropper, and an achievable equivocation region for this channel 
using the compress-and-forward relaying. In section [nil the two special cases of the general 
model, i.e.. Model 1 and Model 2 are described. Section |IV] presents the equivocation capacity 
region for Model 1. Section |V] specializes the achievable region found in Section HI] to Model 2. 
Section |VI] identifies a special class of Model 2, for which introducing a second eavesdropper 
properly will not decrease the secrecy capacity, and derives an upper bound for its secrecy rate. 
The upper bound is then specialized to the Gaussian case of Model 2. Section IVTll investigates the 
secrecy capacity of Gaussian Cover-Kim deterministic relay channel. We note that to facilitate a 
better flow throughout the manuscript, more involved proofs are presented in appendices whereas 
shorter ones are kept in the main text. 
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Fig. 1. Relay Channel with a Co-located Eavesdropper: S: Source Node, R/E: Relay Node with a Co-located Eavesdropper, 
D: Destination Node. 



Throughout this paper, the following notation is used: Vfc, Ek denotes a variable that goes to 
when n goes to oo. C(x) = | log2(l +x). X" denotes a vector of length n, whereas Xi denotes 
the ith element of the vector. X]j[ denotes the set {Xj, l<j<ioTi<j<n}.Xl denotes the 
set {Xj, I < j < i}', the set is empty if i < 1. [a\ denotes the largest integer less than or equal 
to a. The short hand W{a, ...,b) stands for the set {W{a), W{a + 1), W{b)}. The short hand 
W" stands for the set {W{1), W{2), W{a)}. 

II. Achievable Secrecy Rate for the General Relay Channel with Co-located 

Eavesdropper 

The relay channel with a co-located eavesdropper was first considered in [20] and is shown in 
Figure [B It is a memoryless three-node relay channel [25], whose description is p{Y, Yr\X, Xr). 
X, Xr are the channel inputs from the source and the relay respectively, and Y, Yj. are the 
channel outputs observed by the destination and the relay respectively. We assume that there is 
an eavesdropper at the relay node who has access to everything that the relay node knows. The 
source wishes to send message W to the destination over n channel uses, while keeping it secret 
from the eavesdropper. 

Without loss of generality, the relaying function for the zth channel use can be defined as 

Xr, = g^{xi-\Y:-\A) (1) 

where A is a random variable which models any stochastic mapping employed by the relay 
node. Hence, without loss of generality, we can restrict gi to be a deterministic function. 
The information available to the eavesdropper regarding the secret message W is {X^, Y^, A}. 
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Fig. 2. Example: Relay's transmitted signal X" can provide more information about W 

Thus, the equivocation rate is computed as 

Re = \im -H (ly i;", A) (2) 

n— >oo fx 

Note that since W-{X;^, Y^}-A is a Markov chain, we have H = H 

Hence the equivocation rate can be instead defined as 

Re = lim -H{W\Xl^,Y,^) (3) 

n— >oo j2 

Remark 1: In general //(lyjlT^) > F^"). To see an example, consider the channel 

model in Figure |2l Let © denote the binary addition. The channel has binary input. The 
destination receives signals from two orthogonal links: Yr and Yd, and that we have 

Yd = X (4) 

YR = Xr®Z (5) 

where Z represents the i.i.d. binary noise. 

It follows in this setting that, H{W\Xl^,Y^) = 0. This is because the relay can always 
subtract the interference caused by X^ on its received signal and hence obtains X". However, 
H{W\Y^) = H{W) if X" is chosen to be an i.i.d. binary sequences, each component of 
which takes the value 1 with probability i. Thus, in this case, we clearly have H{W\Y^') > 

Hiwixi^Y;"). □ 

We should note however that, H{W\Y^) = H{W\X^, Y^) if the relaying scheme is determin- 
istic: Xr,i = gi{Xl~^, ~^)- Also, note that, clearly, any outer bound derived for the equivocation 
H{W\Y;') is an outer bound for H{W\X^,Y;'). 
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With this preparation, the equivocation rate region can be defined as follows: Let the message 
decoded by the destination be W. The equivocation rate region is composed of all rate pairs 
(/?!, Re) such that: 

Ri = lim — logo \W\ 

n->oo fi 



Re = lim -h{w\x;^,y;') 

s.t. lim - Pi (W y^w) = 



n 

Here \W\ is the cardinality of the message set W. Note that when block Markov coding 
scheme [25] is used, the message is transmitted via successive blocks. In this case, W denotes 
the messages transmitted over all blocks, n should be the total number of channel uses of these 
blocks. The definition of X", should be adjusted accordingly. 

Next, we derive an achievable equivocation region based on compress-and-forward. Compress- 
and-forward scheme was proposed in [25] and has been used for the relay network with an 
external eavesdropper in [13], [26]. In our case, as we will see, the fact that the relay and 
the eavesdropper being co-located brings additional advantage to allow for a higher degree of 
compression to be achieved at the relay as compared to the setting in [13]. 

Theorem 1: For a relay network described as p{Y,Yr\X, Xr), with X, X^ being the input 
from the source and the relay respectively, and Yr,Y being the signals received by the relay and 
the destination respectively, the following region of rate pairs {Ri,Re) is achievable. 

Re<Rl< l(X]YYr\Xr 

0<Re< [I (X; YYr\Xr) - I {X; Yr\Xr)]- 



(6) 



u 

where 

I{Xr;Y)>I{Yr;Yr\YXr) (7) 

and the union is taken over: 

p{X)p{Xr)p{Y,Yr\X,Xr)p{Yr\Yr,Xr) (8) 



Proof: See Appendix [Al 
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Remark 2: Compared with the coding scheme presented in [13], the difference is that we 
have Wyner-Ziv coding. Without Wyner-Ziv coding, the constraint ^ in the Theorem would be 

I{Xr;Y) > I{t;Yr\Xr.) (9) 

which is identical to that in [13, Theorem 4 (12)]. In [13], the eavesdropper is external to the 
relay node, and hence only has a noisy copy of X"(/c). In this case, the equivocation over 
multiple blocks would not necessarily be the sum of equivocation over each block. Reference 
[13] worked around this problem by using a compress-and-forward scheme without Wyner-Ziv 
coding. The equivocation over multiple blocks was then lower bounded by proving that given 
the signal received by the eavesdropper and the secret message, the external eavesdropper would 
be able to determine the signals transmitted by the source and the relay via backward decoding 
[13, Appendix D (53)-(55)]. 

In contrast to that in [13], fortunately, in our model, the eavesdropper has perfect knowledge 
of X^(k). This enables us to compute the equivocation of blocks from the equivocation of 
each block. See (I137l) - (|139l) in Appendix El Hence, the Wyner-Ziv coding is used in our setting 
without difficulty. □ 

Remark 3: Theorem \T\ will be useful in Section |V] in finding an achievable rate for one of 
the models (Model 2) that we will describe in the next section. □ 

Remark 4: We can prefix the channel input X with U and apply Theorem [T] to the channel 
p{Y,Yr\U, Xr). The equivocation region then becomes: 

{ Re<Rl< l{U;YYr\X,) 
\ 0<Re< [I {U; YYr\Xr) - I {U; i;|X,)]+ 
for which (|7]) must be fulfilled, and the union is taken over: 

piU, X)p{Xr)p{Y, Yr\X, Xr)p{Yr\Yr, X,) (1 1) 

Clearly, this may potentially enlarge the achievable region given by Theorem [Tl □ 

Having examined the general relay channel with a co-located eavesdropper, we next consider 
two special cases of it for which stronger results can be derived. 
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Fig. 3. Relay Channel with Orthogonal Components Model 1: Orthogonal Source to Relay Link 

III. Two Special Cases of the General Model: Relay Networks with 

Orthogonal Components 

The two models of the relay network with orthogonal components are depicted in Figures [3] 
and m respectively. Figure [3] shows Model 1. In this model, the relay and the source communicate 
with the destination via a multiple access channel, with its input being XD,Xr and output being 
Y. The source and the relay communicate via a channel orthogonal to the channel used by the 
source and the relay to transmit to the destination. The input and the output of this channel are 
denoted by and Yr respectively. Thus, the overall channel description is: 

p {Y, Yr\XR, Xn, Xr) = p {Y\Xd, X^) p X,) (12) 

The capacity of this network without secrecy constraints was found in [22]. 
The Gaussian case of Model 1 is defined as [22]: 

Yr = aXR + Zi, Y = bXr + XD + Z (13) 

where Zi and Z are independent zero mean real Gaussian random variables each with variance 
N. a and b are channel gains. The transmit power constraints on the source and the relay are 
given by: 

-in 1 " 

- E {E[xh] + mh]) <p^ - T^mu < ip (14) 

Figure |4] shows Model 2. In Model 2, the source communicates with the relay and the 
destination via a broadcast channel, and the relay communicates with the destination via a 
separate (orthogonal) link. Thus, the channel is described by: 

p{YD,YR,Yr\X,X,) = p{YD\X)p{Yr\X,Xr,YD)p{YR\Xr) (15) 
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Fig. 4. Relay Channel with Orthogonal Components Model 2: Orthogonal Relay to Destination Link 

When there are no secrecy constraints, the Gaussian case of Model 2 was considered in [23]. 
The capacity of this channel remains an open problem except for some special cases given in 
[23]. 

The class of channels for which we will be able to derive an upper bound on the secrecy rate, 
is described by: 

p{YD.Yn,Y,.\X,X,) = p{Yn\X)p{Y,\X,YMYR\X,) (16) 

Observe that such a channel is a special case of ([T5l) since X^ is dropped from the condition 
term of Y^ in (fTSl) . 

We will discuss two channels that fall into the class defined by ([T6l) : (i) the Gaussian case of 
Model 2, (ii) the Gaussian Cover- Kim deterministic relay channel [24]. 
The Gaussian case of Model 2 is defined as: 

Yjj = X -\- Z £) Yf, = ctX -\- Zj. 

(17) 

Yr = bXr + Zr 

where Z^jZ^^Zr are independent zero-mean Gaussian random variables with unit variance, a 
and b are channel gains. The transmit power of the source and the relay are constrained by: 

-| n 1 " 

-T.E[Xl,]<Pr, -T.E[Xn<P (18) 

i=i ^ i=i 

The Gaussian Cover-Kim deterministic relay channel is depicted in Figure [5l The received 
signals at the destination and at the relay are given by: 

Yd=X + Z, Yr = aX-Z (19) 

where a is the channel gain and Z is a zero mean Gaussian random variable with unit variance. 
Notice that the random variables representing the noise components have a correlation p = —1. 
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Fig. 5. The Gaussian Cover-Kim Deterministic Relay channel 



Between the relay and the destination, there is a separate noiseless link with rate Rq. The 
destination receives side information from the relay via this link in addition to Yd which it 
receives from the source. The transmission power of the source is constrained to be: 



n 



(20) 



i=l 



In the following sections, we first derive the equivocation capacity region of Model 1. We 
then derive the achievable equivocation region for Model 2 using the results from Section UIl 
Finally, we derive the upper bound for the secrecy rate for the class of Model 2 defined in (fT6l) 
and specialize it to the Gaussian case and the Cover-Kim channel. 



IV. Equivocation Capacity Region for Model 1 
Theorem 2: The equivocation capacity region of Model 1 is given by 

{Ri,Re) : 

U ] < i?i < min I / (Xd, X,- Y) , / [Xr- y;|X,) + / {Xr>; Y\X,) 
0<Re< min{/ (Xd; Y\Xr) , 



p{Xr) 
p{Xo\Xr) 
P{XR\Xr) 



(21) 



Proof: See Appendix |Bl ■ 
Remark 5: Theorem [21 is proved by specializing the results from [21]. The achievable scheme 
is based on partial-decode-and-forward. This entails that the relay decodes the information 
transmitted via X^j. The scheme is outlined next for the sake of completeness: 
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Denote the codebook used by the relay and the source as Cr and {CrXd} respectively. The 
codeword in Cr is denoted by X". The codeword in {Cji,Cj:)} is denoted by {X]^,X]^}, which 
are to be transmitted via and respectively. 

The codebooks are generated as follows: 2"™™'t^(^'^'^)'^(^«'^'-l"^'-)j^ codewords are sampled in an 
i.i.d. fashion from p{Xr) to form Cr- For each X" in Cr, 2"^('^o;>'|Xr) codewords are sampled in an 
i.i.d. fashion from p{XD\Xr) and are included in Co- For each X^^ in Cr, 2"™'^i^(^-;^)'^(^«'^'-l^-)i 
codewords are sampled in an i.i.d. fashion from p{Xji\Xr) and are included in Cr. 

The transmission is divided into N blocks, each composed of n channel uses. The messages 
transmitted by the source during the kth block is denoted by {WD{k),WR{k)}. Woik) corre- 
sponds to the secret part of the message. The cardinality of {Woik)} is smaller than 2"^(^o;>'l^r)_ 
The cardinality of {WR^k)} is smaller than 2"™°^^(^' The signals received and 
transmitted by the relay during the kth block are denoted by Y^{k) and X"(A;) respectively. 
The relay decodes WR{k) from Y^{k) using X^(k) as the side information. Xl^(k) is chosen 
by the relay based on WR{k — 1), which the relay decodes from Y^{k — 1). The source node 
knows WR{k — 1), and hence knows X^{k) before the kth block starts. It locates the part of the 
codebook Cr which is generated according to X^ (k) and transmits the message WR{k) using this 
part of the codebook. The source also locates the part of the codebook Cd which is generated 
according to X^(k) and transmits the message Woik) using this part of the codebook. The 
destination can successfully decode WR^k — 1) from Y'^(k), which determines X^(k), due to 
the fact that the cardinality of {WR{k)} is smaller than 2"^(^'^'^). Then it locates the part of the 
codebook in that is generated according to X^{k) and use it to decode Woik) from Y'"'{k). 
This is possible due to the fact that the cardinality of {Wnik)} is smaller than 2"^(^o;>"l^r) □ 

Remark 6: By letting Re = Ri in (f2T)) . we obtain the secrecy capacity of the network given 

by daa]). 

S= max I(Y;XD\Xr) (22) 

p(Xr)p{Xo\Xr) 

= max I [Y] XolXr = Xr) (23) 

p{XD\Xr=Xr) 

It is readily seen that in this case the relay to destination link is not useful. Additionally, when 
Re < Ri, from the coding scheme outlined in Remark [5l the secret information, Wr){k), is 
only mapped to signal transmitted via Xjj, which means the secret information does not pass 
through the relay node at all. These two observations combined lead to the conclusion that the 
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relay-to-destination link is indeed not useful in improving the secrecy rate of the system, and 

that the untrusted relay should not be deployed at all. □ 

A direct extension of the above result can be readily made to the Gaussian channel. Q 
Corollary 1: For the Gaussian relay network described above, the equivocation region is given 

by (EH). 



Ri < min <^ C 



0<t-,P<l I 0<i?e<Cf^^f^j Re<Rl 

Proof: The proof is the same as in reference [22, Section III]. The three terms: I{X£), Xr;Y), 
I{Xji; Yr\Xr), I{X£,; Y\Xj) are maximized simultaneously when Xr^Xo.Xn are chosen to be 
zero mean and jointly Gaussian with the following parameters: Var[Xr] = 7P, V^ar[Xij] = 
(1 - v)P, Var[XD\ = vP, E[XrXD] = pP^, E[XrXd] = 0. ■ 



V. An Achievable Region for Model 2 

In this section, we present the achievable equivocation rate region for Model 2. 

Theorem 3: For Model 2 defined by (fT5l) . an achievable equivocation rate region is given by: 



Re<Rl< I [X;YDYr\XrYRj . ^^^^ 



0<Re< [I[X;YDYr\XrYR)-I{X;Yr\Xr)]^ 
where 

/(X,; Yn) > lit; YrlYoYRXr) (26) 

and the union is taken over: 

p{X)p{Xr)p{YD\X)p{Yr\X,Xr,Yn)p{YR\Xr)p{Yr\Yr,Xr) (27) 



Proof: We use Theorem [T] In particular, region (l25l) follows from ^ by letting Y = 
{Yd, Yr} and using the following two Markov chains (l28l) and (|29|) . (l29l) follows from the fact 
that {Xr, Yr} is independent from Yd as shown by (l27l) . 

X - X, - Ffi (28) 

'Proofs follow by replacing entropy with differential entropy whenever necessary. 
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Xr - Yr - Yd (29) 
It then follows from ^ and ^ that /(X^; YrYd) = /(X^; Yr) and 

/(X; YnYotlXr) = I{X; YoYrlXrYn) (30) 

■ 

Next, we apply Theorem [3] to the Gaussian case, which is defined by (fTT)) . 
Corollary 2: For the Gaussian relay network with orthogonal components defined by (fTTI) . 
the following rate region is achievable. 

0<R,<R^<c(p+^ 




(31) 



where 



Proo/.- Region (EB follows from letting X ~ AfiO, p),Xr ~ 7\A(0, Pr), % = Yr + Zq, Zq ~ 
A/^(0,o'q), and is independent from all the other variables. Substituting the distribution of 
X, Xj., Fr, ^r, Yfi into (|26l) . we find that we need 

^« > b-PAp+l) ^^^^ 
It is clear from (|3T1) that to make the region as large as possible, ctq should be as small as 
possible, and (|32l) ensures this. ■ 
Remark 7: Suppose a > 1. Without the channel between relay and destination, we have a 
wiretap channel where the eavesdropper has a better channel. Hence, the secrecy capacity is 
zero [27]. We also know that a non-zero secrecy rate cannot be achieved with decode-and- 
forward. However, if the relay to destination gain, 6, is large enough, a non-zero secrecy rate 
can be achieved with compress-and-forward, as can be seen from (|3T1) . This is an example 
where the relay-to-destination link helps to achieve a non-zero secrecy rate when the relay and 
the eavesdropper are co-located. Thus, the untrusted relay is useful and should he cooperated 
with. □ 
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Yr = aX + Zr 
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Yn = b(3{aX + Z.,) + Zr 
Yo^X + Zd 



Fig. 6. The Equivalent Wiretap Channel of Model 2 using Amplify-and-forward Relaying 

Remark 8: The scheme we present here differs from the noise forwarding scheme of [13] 
where the relay transmits noise that is independent from its received signal. By contrast, in this 
work, the signal transmitted by the relay is computed from its received signal. □ 

Remark 9: The amplify and forward scheme can also be used at the relay. Let p be the average 
transmission power of the source node. Then, in this case, the signal transmitted by the relay at 
the ith channel use is given by 



Note that in (|34|) . we force Xr^i to depend on the signal received in the previous channel use 
Yr^i-i in order to preserve the causality of the relay function as defined in ([T]). However, because 
the channel between the relay and the destination is orthogonal to the one between the source 
and the destination, the fact that the signals received via is delayed by one channel use 
compared to those received via does not make any difference to the destination. Therefore, 
it is safe to write = PYr and omit the subscript i. 

The relay network is therefore equivalent to a Gaussian wiretap channel as shown in Figure [6l 
The achievable secrecy rate is computed from [/(X; YflYo) — I{X;Yr)]^ [3] for a Gaussian 
distribution for X: X ~ J\f{0,p) and when maximized over p, the secrecy rate is given by: 




where f3 




(34) 



V a'^p + 1 



where for (3 defined in (|34l) . ^ is given by 




(35) 
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Fig. 7. Effect of Source Power Control 

Observe that amplify-and-forward can also achieve a non-zero secrecy rate given a large enough 
b. However, comparing it to (|3T1) . we find that the secrecy rate given by amplify-and-forward is 
strictly smaller than the secrecy rate achievable by compress-and-forward. □ 

Remark 10: When there are no secrecy constraints, for compress-and-forward/amplify-and- 
forward, the source should always transmit at maximum power. However, when there are secrecy 
constraints, for compress-and-forward/amplify-and-forward, the source may not transmit at max- 
imum power. This can be shown as follows. 

We first look at the case where there are no secrecy constraints. The rate for compress-and- 
forward follows from the maximum possible value of Ri in Corollary [2l which is 



where (Jq is given by (|32|) . Recall that p is the average transmission power of the source node. 
Hence we only need to show that 



is a monotonic function of p which is proved in Appendix O Hence to maximize Ri we should 
choose p = Pi. 




(37) 




(38) 
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The rate for amplify-and-forward is derived by ignoring the eavesdropper in Figure [6l The 
achievable rate is I{X; YrYd), which, using the Gaussian input distribution for X, equals 

max -log(l + (1 + (39) 



where ^ is given by (1361) . To prove that (1391) is maximized at p = Pi, it is sufficient to prove 
that is a monotonically increasing function of p, which can be shown by rewriting ^p as: 

(40) 



1 -I- ^ 1 I a'^P+i l+^-TT 9 

^ + pH^ 1 + -fT^ + 

When there are secrecy constraints, the secrecy rate is not necessarily maximized at p = Pi. 
This can be observed in particular when h (the relay to destination link gain) is small. In this 
case, for compress-and-forward, as shown in (l32l) . the quantization noise ctq will increase more 
rapidly with source power p. Similarly, for amplify-and-forward, the ^ in (l36l) will decrease more 
rapidly with source power p. This, along with the negative term —C{a^p) present in (|3T] ) (|35] ). 
may offset the benefits of having a larger source power p. This phenomenon is demonstrated 
numerically in Figure |7l where the source-to-relay channel gain a = 1.2. Both compress-and- 
forward and amplify-and-forward can achieve a larger secrecy rate when power control is used 
at the source. Moreover, compared to compress-and-forward, amplify-and-forward benefits more 
from judicious power allocation at the source. □ 

VI. Upper Bound for the Secrecy Rate Of A Special Class of Model 2 
A. The Enhanced Channel 

In this section, we describe the general methodology that we use to derive the upper bound. 
Our upper bound involves introducing a second eavesdropper. The focus of this section is to 
investigate the sufficient condition such that doing so will not decrease the secrecy capacity of 
the channel. In Section IVI-B[ this will be useful in finding the upper bound for the secrecy rate 
for a class of channels conforming to Model 2. 

We focus on the case there is no feedback from the relay's output Xj. to its input Yj., which 
means the conditional probability distribution of the channel should have the following form: 

p[Yr\X)p{Y\X,Xr,Y,) (41) 

Note that due to the absence of feedback, we drop the term Xj. from the conditioning of Yj.. 
The reason that we choose this distribution to study will be clear shortly. 
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Fig. 8. The Relay-Eavesdropper Separation Argument 

Deriving the outer bound entails a "relay-eavesdropper separation" argument. In other words, 
the net effect of this argument is to change the eavesdropper that is co-located with relay node, 
to an eavesdropper that is external to the relay node. Illustrated in Figure [8l this means: 

1) We add a second eavesdropper to the relay network, who sees a channel that is statistically 
equivalent to the channel seen by the relay node. Let the signal received by this second 
eavesdropper be Y^. That is, we have: 

p{Yr\X,Xr)=p{Y,\X,Xr) (42) 

2) We remove the first eavesdropper. 

The reader, at this point, rightfully should question the validity of step 1). This is because, 
as mentioned earlier, introducing a second eavesdropper, can decrease secrecy rate in general, 
even if the second eavesdropper observes a statistically equivalent channel as in (|42|) . This is 
because the second eavesdropper may be able to hear the transmission signal Xj. of the first 
eavesdropper, and these two eavesdroppers can potentially cooperate. An example is provided 
in Appendix O to demonstrate this phenomenon. 
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We next show that, for the channel model in (|4TI) . introducing a second eavesdropper, if done 
with care, will not alter the secrecy capacity of the system. In particular, let the received signal 
of the "second" eavesdropper be defined as follows: 

p{Yr\X)p{Y,Y,\X,Xr,Yr) (43) 

s.t. p{Y,\X,Xr) = piY,\X) = p{Yr\X) (44) 

Note that the second equality in (l44l) is (l42l) specialized for (|4TI) . We reiterate that though a Y^ 
conforming to these conditions may not exist for any arbitrary relay network, for the Gaussian 
relay network models we are interested in, such a Ye can be found, as will be seen in the sequel 
(See (El, ^M, (fTT3l) ). 

For this choice of Yg, we have the following theorem: 

Theorem 4: For the relay channel defined by (|4T)) . (I43l)- (l44)) are sufficient for the secrecy 
capacity of the channel after introducing the second eavesdropper to remain identical to the 
secrecy capacity of the original channel. 

Proof: Due to the addition of the second eavesdropper, we know that the secrecy capacity 
of the new channel < the secrecy capacity of the original channel. Therefore, we only need to 
show that the secrecy capacity of the new channel > the secrecy capacity of the original channel. 

We use q to denote any distribution related to the new channel, and p for any distribution 
related to the original channel. Suppose the new channel uses the exact same coding scheme and 
the same message set {W} as the original channel. Then we can make the following statements: 

1) Suppose W can be reliably received by the destination at a rate of Re in the original 
channel. Then it must be reliably received by the receiver at the same rate in the new 
channel as well, because these two channels share the same coding scheme and the same 
channel statistics. 

2) The transmitted message W is still secret from the first eavesdropper co-located with the 
relay, since we are using the exact same coding scheme of the original channel. 

3) We next show that H{W\Y^) of the new channel equals H{W\Y^') of the original channel. 
To do that, it is sufficient to prove that g(Fg"|iy) of the new channel equals p{Y^\W) of 
the original channel, as we show nextj^ 

^ It is understood in the case of continuous random variable, the sum should be replaced by integral. In fact, both of them 
can be expressed as integral by defining the measure properly. 
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First we state two Markov chains, which are proved in Appendix |El 
We have: 



(45) 



n 

' Y,Y[p(Yr,,\x-, r-i) p {x-\w) (50) 



q{Y:\W) 

=Y.qiy:\xn<imw) (46) 
= E n ? {YeM^ y::,') q {x^m (4?) 

X" i=l 

( "I 

= En9(^e.|X,)g(X"|iy) (48) 

(c). 

X" i=l 

= J2p{Y;^\x-)pix-\w) (51) 

X" 

=p(niW^) (52) 

Here step (a) follows from the Markov chain Fe,j — Xi — Xji^Fg J^. Step (6) follows from 
the fact that these two channels share the same coding scheme, p{X^\W) = q{X^\W), 
and the constraint we placed on the marginal distribution q{Ye\X) = p{Yr\X). Step (c) 
follows the Markov chain Fj. j — Xi — X]i[Y^J^. 

The fact that introducing an eavesdropper does not reduce the secrecy capacity can then be seen 

from the following relationship: 

lim -/ (W; K"Xr) > lim -I (W; K") = lim -/ (W; Y") > (53) 

If ]irn ^I{W; Y^X"^) = 0, then {W;YJ') = 0. Therefore, for a given coding scheme, if 
W is kept secret from the eavesdropper at the relay, it is also kept secret from the newly 
introduced eavesdropper. Hence any secrecy rate achievable in the original channel is achievable 
after introducing the second eavesdropper. This means the secrecy capacity remains the same. 



21 



Theorem |4] shows that if the relay is not self -interfering, adding an eavesdropper as described in 
step 1 will not incur any loss in secrecy rate. This, along will step 2, will result in an "enhanced" 
channel whose secrecy rate is an upper bound to that of the original channel. 
Remark 11: Actually, for the channel model in (|4TI) . we have 

XI!) = HiWlY;") (54) 

This means the secrecy capacity of the channel model in can be computed via lim„_^oo ^H(W\Y^) 
instead. This is proved in Appendix IBn 

Remark 12: Note that the conditional probability distribution of the relay channel p{Y, Yr\X, Xr) 
is left intact. The benefit of the separation argument is that we have freedom in choosing Ye, as 
long as it conforms to (l43l) and (l44l) . Choosing Ye properly allows us to tighten the bound.D 

B. Upper Bound for a Special Class of Model 2 

We next use the result we derived in Section IVI-AI to upper bound the secrecy rate of a class 
of relay channels. This class, as we mentioned earlier, is given by (fT6l) . which can be specialized 
from (|4TI) . Equation (|43l) becomes: 

p{Yd, Yr, Yr, Ye\X, Xr) = p{Yn\X)p{Yr\X , Yn)p{YR\Xr)p{Ye\X, Yn, Y^) (55) 
p{Yr\X) = p{Ye\X) (56) 

Definition 1: Define V as the set of joint probability distribution functions of Yd, Yr, Yr, Ye, X, Xr 
such that ([55]) and are fulfilled. 
With this definition, we have the following theorem: 

Theorem 5: For the relay channel defined in (fT6l) . where the relay is the eavesdropper, the 
secrecy rate Re is upper bounded by 

f I{X;Yn\Yr) 
max min < 
^'(^'^'■) [ / (Xr-, Yr) + mm/ (X; YolYe) 

Proof: The first term can be obtained by specializing the result from [21]. Reference [21, 
version 7,(13)] claims for a general relay p{Y,Yr\X, Xr), the secrecy rate is upper bounded by 

Re<I{X;Y\Yr,Xr) (58) 
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Specializing it to our channel, which means replacing Y with Yd, Yr, we have 

I{X;YD,YR\Yr,Xr) (59) 

</ (X; F^|y;, X,) + / (X; Foil;., X„ F^) (60) 

From (1551) . X — {Yr, X^} — is a Markov chain. Hence (|60l ) equals: 

I{X;YD\Yr,Xr,Yi,) (61) 

=/i (Fz^lF,, X„ Yr) - h {YnlYr, X„ X, F^) (62) 

</i (Fz^li;) - h {YolYr, Xr, X, Yr) (63) 

From (|55l) . Yd - {X, Fr} - {X^, Yr} is a Markov chain. Hence (l63]) equals 

/i (FDin) - h {Yn\X, Yr) = I (X; Yd^) (64) 

Hence we have proved the first term. 

Next, we proceed to bound the second term: 

h{w\y:) 

<I{W;YSY^\Y:'')+ne, (65) 

=/ {W; Y^\Y:) + I {W; Y^lY^Y^) + ne, (66) 

</ (W^X"; F^|y,") + / {WXl^; Y^lY^'YS) + nsi (67) 

=/ (X"; YJ^\Y^) + / {WX^- Y^IY^'YS) + ne^ (68) 

n 

=1 (X"; YS\Y:) + h (F^ln^F^) - E ^ + (^9) 

i=l 

(X- + h {Y-\Y,-YS) - E h {YR,\Xr,) + ne, (70) 

i=l 

n 

</ (X"; r^|F,") + Y.{h {Yr,,) - /z (Fii,|X,,0) + nsi (71) 

n n 

=h {YS\Y:) - E h {YnAY:X-Yi,~,') + E / {Xr,; Yr,) + nei (72) 

i=l i=l 

,1 n n 

= h {YS\Y:) -Y^h {YD,i\Y,,,Xi) + {Xr,i; Yr,,) + ne, (73) 

n n 

< E / {X^; YD,i\ye,i) + E ^ (^-,^; ^^,0 + (74) 



i=l 
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=nl {X- Yd\Y,, Q)+I (Xr; Yr\Q) + ne, (75) 

=nh (YnlYe, Q) - nh {Yd\Y,, X) + nh {Yr\Q) - nh {Yr\X,) + ne^ (76) 

<nh {Yd\Y,) - nh {Yd\Y,, X) + nh (Yr) - nh (F,j|X,) + ne^ {11) 

=nl {X-Yd\Y,) + nl {Xr-.YR) + nei (78) 

Here step (a) follows from Fano's inequality. Step (h) follows from the relay destination link 
being orthogonal to the rest part of the channel. Step (c) follows from the fact that the relay is 
not interfering the second eavesdropper. Therefore given {Ye^i^Xi}, the signals {Yej,j > i} do 
not provide further information about Yo^i- ■ 
Remark 13: Another upper bound that can be obtained is 

I{X;YDYr\Y,) (79) 

which is proved in Appendix [Gl and it can be further tightened by choosing Yg. However, as 
shown below this upper bound does not improve the first term in (l57l) . 

I{X;YMY,) 

=1 {X;YnYrY,) - I {X;Y,) (80) 

=/ (X; YoYrY,) - I (X; Yr) (81) 

>I (X-YnYr) - I (X-Yr) (82) 

=I{X;Yn\Yr) (83) 
Step (6) follows from p{Yr\X) =p{Ye\X). □ 

C. The Gaussian Case of Model 2 

Using Theorem [51 we now evaluate the upper bound for the Gaussian channel. 
Corollary 3: For the Gaussian case of Model 2, which has independent noise components, 
the upper bound on secrecy rate is: 

min [c{b'P) + [C{P) - C{a'P)]\ C {y^) } ^^4) 
Proof: First we notice that (|57l) is upper bounded by: 



mm 



max I(X;YD\Yr) 

p{X,Xr) 

max I (Xr', Yf/) + min max I (X: YnlK 



(85) 



24 




b(channel gain between relay and destination) 

Fig. 9. Secrecy Rate of the Gaussian Ortliogonal Relay Channel 

Since p{Y(,\X) = p(Yr\X), we define a Gaussian random variable N^. such that 

Ye = aX + Ne (86) 

Then the set V can be re-parametrized with the correlation between N^, Nr and the correlation 
between N^^Nd. For a given correlation, it is known that I(X;Y£)\Ye) and I(X;YrYu\Ye) are 
both maximized with Gaussian distribution [5, Appendix II]. The first term in (|85l) then becomes 
I log2 (l + Y4^p) • To obtain the second term inside the minimum, we choose Ne/a? = N^+N', 
with TV', Nu being independent if a < 1, otherwise we choose Nd = Ne/a? + A^', with A^', A'e 
being independent, where A^' is a zero mean Gaussian random variable with appropriate variance. 

■ 

In Figure |9l we compare the upper bound with the achievable rates for the Gaussian case 
of Model 2. We fix the source-to-relay channel gain a = 1, and vary the relay-to-destination 
channel gain 6. As 6 — oo, we observe that the upper bound becomes tight. As 6 — 0, the upper 
bound decreases. This improvement is due to the first term in Corollary [3l 
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VII. The Cover-Kim Deterministic Relay Channel 

In this section, we investigate the Cover- Kim deterministic relay channel of [24], [28] whose 
capacity is established therein. The channel was defined in Figure \5\ in Section Hill 
For the achievable secrecy rate, we have the next theorem. 

Theorem 6: For the Gaussian Cover-Kim deterministic channel, the following secrecy rate is 
achievable: 



Ro + C{P)-C[a^P)\ (87) 

Proof: Let C be a random code book with 2L"h+^(^)-^("'^)]^J2L"^("'^)J codewords 
sampled from an i.i.d. Gaussian distribution with zero mean and variance P. These codewords 
are randomly partitioned into 2L"[^o+'^(^)~'^("^^)] J bins of equal size. The bin index of the 
transmitted codeword is determined by the message W. The actual transmitted codeword is then 
selected randomly from this bin according to a uniform distribution. The relay uses either hash- 
and-forward or compress-and- forward as described in [24]. Let _E'[Pe|C] be the average error 
probability over the codebook ensemble {C} that the destination could not correctly determine 
X", hence W, from Yg and side information provided by the relay. It was proved in [24] that 

\imn-,ooE[Pe\C]=0. 

Since each bin is a Gaussian codebook by itself whose rate is below the AWGN channel 
capacity between the source and the relay, the relay node can determine X" given W and 
with high probability using jointly typical decoding. Therefore, from Fano's inequality, we have 
H {X''\WY^C) < HE I. Thus: 

H {W\Y,X) =H (X^'WIY^C) - H (X"|iyi;"C) (88) 

>H (X'^WIY^C) - nei (89) 

=H (X"|y;^C) + H (ly |X"17C) - nei (90) 

=i7(X"|y,"C) -nei (91) 

=if (X"|C) - J(X";y,"|C) -nei (92) 

>/7(X"|C) - J(X";y,") -nei (93) 

n 

>H (X"|C) - ^ / (X,; 1;,,) - nsi (94) 

i=l 
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Since each code word is selected with equal probability, we have 

lim -H (X'^IC) = CiP) + Ro (95) 

Also, I {Xi;Yr^i) = C{q^P). Substituting this and ([95]) into ([941), dividing it by n and taking 
the limit n ^ oo, we have (|87l) . which equals lim„_^oo ^H{W\C). Therefore lim„„,oo -Ef-PelC] + 
^I{W;Y^\C) =0. Since both terms inside the limit are non-negative, this proves the existence 
of at least one codebook with a rate of [Rq + C (P) — C {a^P)]^ such that both terms are 
arbitrarily small. Hence we have proved the theorem. ■ 
Theorem 7: The secrecy rate of the Gaussian Cover- Kim deterministic channel is upper bounded 

by 

Rq^\C[P)-C{o?p)^ (96) 

Proof: We use Theorem |4] to separate the eavesdropper and the relay. Let be the signal 
received by the eavesdropper such that (l55l) and (l56l) are met. Then, we have: 

E {w\y:) 

<I (W; F^X^lFe") + nei (97) 

=1 {W; y^|y,") + / {W; XllY^^YS) + ne^ (98) 

<I{W;Y^\Y:') + H{X^)+nei (99) 

</(iyX";F^|y,") + if(X^)+n£i (100) 

=J(X";F^|F,") + /7(X^)+n£i (101) 

n 

=h (Y^IY:) -Y^h {YoAY:X-Yi,:,') + H {X^ + (102) 

1=1 

^^h {Y^\Y:) -j^h {Yo,\Y,,X,) + H {XI) + ner (103) 

i=l 

n 

<J2l {X,; YD,^e,i) + H TO) + ne, (104) 

i=l 

=nI{X;YD\Y,,Q) + H{X'^) + nei (105) 

=nh {YolYe, Q) - nh {Yd\Y,, X) + H {XI) + nsi (106) 

<nh {YD\Ye) - nh (YdII;, X) + H {X^) + nei (107) 

=nl {X-Yd\Y,) + H {XI) + nsi (108) 
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<nI{X-YD\Ye)+nRo + ne2 (109) 

Step (a) follows from Fano's inequality. Step (b) follows from the fact that the relay is not 
interfering with, i.e., heard by the (second) eavesdropper. Therefore, given {Ye^i,Xi}, signals 
{Yej,j > i} will not provide more information about Yd j. 
The bound is further tightened by choosing Y^. properly. 

1) If a > 1, then 

Ye = aX + Z (110) 

Yd = X + - + Z' (111) 

a 

Yr = X---Z' (112) 
a 

Z' is a zero mean Gaussian random variable with variance |1 — ^|, and Z' is independent 
from Z. 

2) If a < 1, then 

Ye = X + Z+Z' (113) 

Z' is a zero mean Gaussian random variable with variance |1 — ^|, and Z' is independent 
from Z. 

Substituting these choices of Y^ into (11091 ), we get (|96l ). ■ 
Remark 14: Inspecting (l87l) and (l96l) . we see that the upper bound and the achievable rate 

coincide when a < 1. Hence, for a < 1, i.e, when the source to destination link is not worse 

than the source to the relay link, the secrecy capacity is achieved by compress-and-forward. □ 
Remark 15: The secrecy capacity can exceed the direct link capacity if Rq > C{P). This is 

a benefit of the correlation of the noises corrupting the links from the source. If the noises are 

independent, the secrecy capacity cannot exceed C{P), as proved next: 
Observation 1: If the relay channel has the property: 

p{YR,YD,Yr\X,Xr)=p{Yn\Xr)p{Yr\X)p{YD\X) (114) 

Then i?e < / {X- Yd) 
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Fig. 10. Secrecy rate for the Gaussian Cover-Kim Deterministic Relay channel 



Proof: From (|57] ), we have 

Re < I {X-YD\Xr) (115) 
= h{YD\Xr)-h{YD\Xr,X) (116) 

= h{Yr>\Yr)-h{YD\X) (117) 
<I{X-Yd) (118) 

where in (II 171 ) we use the Markov chain Yo — X — Yr. ■ 

□ 

We conclude this section by presenting Figure [10] which shows the upper bound and the 
achievable rate for Rq = 0.5 bits/channel use and P = 1. As expected, the two meet for a < 1, 
yielding the secrecy capacity. 

VIII. Conclusion 

In this paper, we have considered the relay channel with an untrusted relay that is treated 
as an eavesdropper. In particular, we focused on two relay channel models with orthogonal 
components. For the first model, we have found the capacity-equivocation region and proved 
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that the relay-destination link does not help in increasing secrecy rate, and therefore the untrusted 
relay should not be deployed if perfect secrecy is desired. In contrast, for the second model, we 
have found an achievable secrecy rate which calls relay's cooperation and improves the secrecy 
rate as compared to treating it simply as an eavesdropping node. Thus, we conclude that, for this 
model, the untrusted relay may help the source and the destination to communicate despite being 
subjected to the secrecy constraint, and that cooperation with the untrusted relay is beneficial. 

We have provided a channel transformation that separates the relay and the eavesdropper to 
upper bound the secrecy rate for a special class of untrusted relay channels. We have found this 
approach to be useful in upper bounding the secrecy rate for two cases: For the Gaussian relay 
channel with an orthogonal relay-destination link, this new approach yields a computable bound 
that tightens previously known bounds. For the Gaussian Cover-Kim deterministic relay channel, 
we have shown that this approach finds the secrecy capacity when the source-destination link is 
not worse than the source-relay link. 

Since the first example demonstrating the potential benefit of cooperating with an untrusted 
relay [29], there has been recent growing interest in communication models with untrusted relays. 
Notable recent developments include work on the multiple access channel with generalized 
feedback [30] and relay broadcast channel [31], [32], where, in addition to the secret message 
considered in this work, the untrusted relay node has its own secret message. The role of untrusted 
relay is examined in bi-directional communication in [33], [34], where the relay node in a two- 
way relay network is untrusted. A case for the communication scenario with multiple untrusted 
relay nodes is recently presented in [35], [36], where the source and the destination can only 
communicate via a chain of untrusted relay nodes. All these works, like this paper we are about 
to conclude, speaks to the merit of cooperative communication even with untrusted partners, and 
that cooperation and secrecy can go hand in hand. 

Appendix A 
Proof of Theorem H] 

The achievable scheme of Theorem [T] is a combination of stochastic encoding at the source 
node and compress-and-forward at the relay node. The compress-and-forward relaying scheme 
is the same one described in [25]. The achievable scheme involves N blocks of channel uses. 
Each block is composed of n channel uses. 
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A. Codebook Generation 

1) The Codebook of the Source Node: The source uses a codebook composed of i.i.d. 
sequences sampled from the distribution p{X). Each codeword has n components. In order 
to confuse the relay/eavesdropper, the codebook is further partitioned randomly to bins. Suppose 
there are 2"^ bins. Each bin contains 2"^ codewords. B is chosen such that: 

nB = [nI{X;Yr\Xr)\ (119) 

The reason behind this choice will be clear shortly. Each codeword is hence indexed by the 
label {b,c}, where c is the bin index and b indexes the codeword within the bin. The rate of the 
codebook is given by 

lim - log, = \im B + C = I (X; YJXr) + lim C (120) 

n— >oo fi n^oo n— ♦oo 

2 ) The Codebook of the Relay [25 ]: 

i) The signal transmitted by the relay is from a codebook composed of i.i.d. sequences 
sampled from the distribution p{Xr). Each codeword has n components and is denoted by 
X"{d). The codebook has 2"^ codewords. 

ii) For each d, we generate 2"^ codewords, each with n components, denoted by Yr{e\d). 
The ith component of the codeword is drawn from p{Yr\Xr = X"j((i)) in an i.i.d. fashion. 

iii) For each d, we randomly bin the label e into 2"^ bins and label each bin with a d according 
to uniform distribution. This random binning is used for Wyner-Ziv coding. 

We use C to denote the random codebooks generated for the source and the relay. 

B. Stochastic Encoder at the Source Node 

The codeword transmitted as the kth block is indexed by label bk,Ck, where Ck is the bin 
index and bk indexes the codeword within the bin. Let W{k) be the message transmitted at the 
kth block. Recall that Ri is the rate of the message W{k). Hence Ri = log2 \ W{k)\/n. The 
messages are mapped to the codewords as follows. 

i) If Ri > C, Ck is the bin index determined by W{k). The codewords in bin Ck are partitioned 
into 2"(^i~^) subsets. The subset is chosen according to the unmapped part of W{k). Then 
bk is selected from this chosen subset according to a uniform distribution. 
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ii) If Ri < C, Cfc is still determined by W{k). is randomly chosen from group according 
to a uniform distribution. 
For this mapping, we observe that the cardinality of Ck is 2" ™™^^^''^^^. 

Only A^ — 1 messages W{1)...W{N — 1) are transmitted over blocks. During the last block, 
the relay and the source agrees that the source will send message 1. 

C. Compress-and-forward at the relay [25] 

During the A;th block, the relay node first compresses to Yjl^{ek\dk). is indexed by 

two labels: and dk- dk is chosen to be the label that corresponds to X^(k). Hence a different 
set of 1"", of size 2"^, is used for compression depending on the value of X^{k). The label Cfc 
is chosen to be the first element in the following set: 

{e : y;" (e|4) , {k) , (k) are jointly typical} (121) 

If the set is empty, = 1. The size of the codebook Y^{e\dk) should be sufficiently large for 
the set to be nonempty, which requires E > Yr\Xr) [25]. 

Label is transmitted during the A;+lst block. At this time the destination has received Y"-{k), 
and can decode X^(k). Since {Y'^(k), X^(k)} provide side information to the destination about 
Cfc, Cfc can be compressed further before transmission. This is done via Wyner-Ziv coding. Recall 
that the set {e, d = d^} is randomly binned. The size of each bin should be chosen such that the 
destination can decode ek, and hence determine Y^{k) from this bin from the side information 
{F"(A;),X;?(A;)}. This requires /(Y;; Y\Xr) > E-D. Only the bin index is transmitted. Recall 
that each bin is labeled with d. Hence this determines d^+i, which determines X"(A; + 1). 

Remark 16: One important aspect of this coding scheme is that the signals transmitted by 
the relay during different blocks {X^(k),k = 1...N — 1} are correlated. This is because, as 
described in the coding scheme at the relay, each X"(A;) is determined from YjP' {ek-i\dk-i) , 
which is shown in (|12H) to be related to X^(k — 1). Because of the self interference at the relay, 
the signals received by the relay during different blocks {Y^{k), k = 1...N — 1} are correlated 
as well. However, Y^{k) is correlated with past Y^{p),p < k only through X"(/c). This property 
will be useful in bounding the equivocation rate. □ 
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D. Decoder at the Destination 

Recall that the short hand W{a,...,h) stands for the set {W{a),W{a + I), ...,W{h)}. The 
short hand W stands for the set {W{1), W{2), W{a)}. 

The destination first decodes X^^^^^\ The decoding at the k\h. block happens as: It first 
decodes X^{k) from Y"'{k). For this, we require D < I{Xr]Y). It then determines rf^ from 
X^{k), which determines the bin that contains Ck-i- It next determines ek~i by finding the label 
e in this bin such that Y^{e\dk^i) is joint typical with {Y"-{k — l),X^{k — 1)}. This determines 
y;"(efc„i|4-i)- Finally X"(A; - 1) is decoded from Y''{k - 1), - l),X:;{k - 1). For details, 
the reader is referred to [25, Theorem 6]. 

Let the decoding result be According to error probability analysis in [25], if the rate 

of the codebook of the source meets the condition: 

lim - log2 < I(X] Yr, Y\Xr) (122) 

and the following condition is fulfilled: 

I{Xr; Y) > Yr\YXr) (123) 

then 

lim E[Pr(X"(^-i) ^ X"(^-i)|C)l = (124) 

n— >oo 

The expectation is taken over the random codebook C. 
Combining (|122|) and (11201) . we have: 

liX: YAXr) +\imC < I(X: F IXJ (125) 

n^oo 

The destination then computes W^~^ from since the former is a deterministic 

function of the latter. The average probability of decoding error for is hence upper 

bounded by the average probability of decoding error of X<^-^\ Therefore equation ([T24l) 
implies: 

lim E[Pr(Ty^-^ ^ W^-^\C)\ = (126) 
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E. Equivocation Computation 

Let c^^^ denote {ci, C2, cat^i}. The computation of the equivocation rate starts from the 
following expression: 

H{c^-^\Y^^,X'^^,C) (127) 

=/7(c^-^|F,"(^-i),X,"^,C) (128) 

=H{c''-^\Y:'^''-^\xf''-^\C) (129) 

Here, the first equality follows from the fact that conditioned on X^(A^), the signals F^"(A^) are 
independent from Y^'-'^^~^\ X^^^~^\ c^~^. The second equality is because, as described in the 
relaying scheme above, X"(A^) is a deterministic function of Y^"'{N — 1) and X''^{N — 1). 

To simplify the notation, we omit the C from the conditioning term in the derivation below 
and only mention it when necessary. 

Equation (11291 ) can be reformulated as: 

i/(c^-i|Fj^-i>",Xf-i)") (130) 
+ H (x(^-i)"ii;(^-i)",X(^"i)") (131) 

= -H (x(^-l)"|c^-\r,(^-l)",X(^-l)") + H (x^N-l)ny^{N-l)n^j^iN-l)n^^ (^32) 



_ J ^X^N-l)n. yJA^-l)n^ Xf -1)") (133) 

From the description of the stochastic encoder at the source node, we observe that each block 
X'^(z), z = l,...,X — lis independent from each other. Hence the second term in (11331) can be 
expressed as: 

N-l 

H (x(^-i)") = J2 HiX^'ii)) (134) 

i=l 

From the codebook used by the source node, we observe that 

(X"(i)) = n{B + mm{C, Ri}) (135) 

Therefore 

H (x(^^i)") = {N- l)n{B + min{C, i?i}) (136) 
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The first term in (11331 ) can be upper bounded as: 

H (x(^-i)"|c^-\y,(^-i)",Xf -1)") (137) 

N-l 

= 5^ if (X"(z)|X(^-^)",c^-i,yJ^-i)",X^^-i)") (138) 



i=l 
N-l 



<Y.HiX'^it)\ci,Y;it),X:it)) (139) 



i=l 



Recall that X^(i) is determined by two indexes: bi and q. The label Cj is already on the condition 
term. To determine bi, we notice from (II 191 ) that: 

B < I{X;Yr\Xr) (140) 

With this constraint, for a given codebook C, which is on the condition term implicitly, the 
eavesdropper can estimate b^ from the following set: 

{b : a), 17(2), X;^(2) are jointly typical} (141) 

This set should contain only bk with probability close to 1. From Fano's inequality, we have 
H {X''{i)\ci,Y;'{i),X^{i)) < nei, where si > and lim^^oo = 0. Therefore from (fT37l) - 
( 11391 ). we have: 

H (x^^-i^'^lc^-S <{N- l)ne, (142) 

The third term in (11331) can be upper bounded as follows: 

/ (X(^-I)"; rj^-i)", -1)") (143) 

N-l 

= j2i (x(^-i)"; y;^ (i) , xi^ (z) ir/'-^)", x^^-^)") (i44) 

i=l 
N-l 



+ J (x(^-i)"; X," (i) X,('-i)") (145) 

For compress-and- forward relaying, as explained in the previous section, X"(i) is a deterministic 
function of F/*"^)*^, X^*~^)" . Hence the second term in (11451) is zero. (11451 ) therefore equals: 

N-l 

J2 I (x(^"i)"; f;^ (i) ir/'^i)", x^^^-^)", x;^ (i)) (i46) 

i=l 
N-l 

<Y.h (n" (0 \K (0) - ^ (>7 (0 X," (z) , X(^-^)") (147) 

i=l 



35 



From the coding scheme described in Section lA-Cl we observe Y^(i) depends on {FJ* 

X"(^ + 1, ...n)} only through Hence 

i;"(z)-{X,"(z),X"(z)}-{y/-i)",X(-i)",X"(l,...,z-l),X"(z + l,...,n)} (148) 

is a Markov chain. Therefore (11471) equals: 

N-l 

(n" it) \X: it)) - h {Y; (z) \X: (z) , X" (z)) (149) 

i=l 
JV-1 

= E^(^"(0;>;"(OI^"(0) (150) 



1=1 



<(Ar-l)(n/(X;F,|X,) + n£2) (151) 

where where €2 > and lim„^oo £2 = 0. Equation (11511 ) follows from the fact the channel is 
memoryless and the codebook is composed of i.i.d. sequences. 
Applying (fT42l) . (fT36l) and (fTSTl) to (fT33]) . we have 

if(c^"i|y,"^,X,"^,C) (152) 

>(iV - l)r2(fi + min{C, Ri}) - (N - l)nei - (N - l){nI{X; F,|X,) + n^s) (153) 

>{N- l)n(min{C, Ri}) - (N - l)n{ei + 52) (154) 

=H{c^-' \C) - (iV - l)n{ei + £2) (155) 

Equation (11541) follows from (11401) . (11551) is because Cfc in each block is chosen independently 
of other blocks, Ck is chosen according to a uniform distribution from a set with a cardinality 

From (I152D - (I155I) . we have: 

1 



lim -/(c^-^ F,"^, X;^|C) = (156) 



Combining it with (11261) we have 



lim E[Pr(W^^-i ^ l^^-i|C)] + -/(c^-^ F,"^,X,"^|C) = (157) 
Therefore, there must exists a codebook C* such that 

lim E[Pr(iy^-i 7^ W^^-^IC*)] + -/(c^-^ K"'^,X"^|C*) = (158) 

n— >oo 7T, 
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Since each term on the left side of (11581) are nonnegative, it follows that with this codebook: 

lim E\Ft(W^^^ ^ W^^^IC*)] = (159) 

lim -/(c^-'; K"^, X"^|C*) = (160) 

For the simplicity of notation, we omit C* from the conditioning term. It is understood that all 
the derivations below are conditioned on C*. 

Since c^^^ is a deterministic function of we have 

i7(PF^-i|y,"^,x,"^,c) 
=h{c^-\w''-^\y;'^,xi^^,c) (161) 
>i7(c^-i|y;^^,x;^^,c) (i62) 

>(A^ - l)n(min{C, Ri}) - {N - l)n{ei + £2) (163) 



Hence, 



R,= lim lim ^i7fiy^-i|K"^,X"^) (164) 
> mini lim (165) 



The achievable region can then be discussed for two cases: 

i) Ri < lim„^oo C 

ii) Ri > lim„^oo C 

When i?i < lim^^oo C, the region (_Ri, R^) is given by: 

0<Re<Rl 

(166) 

< i?i < lim C 

n— >oo 

where Um^^oo C meets the following condition from (11251) : 

< lim C <[I (X; F, y;|X,) - /(X; F,|X,)]+ (167) 



subject to the constraint /(X^; F) > Yr\YXr). When Ri > lim„^oo C, the region Re) 
is given by: 

0<Re< lim C 

(168) 

lim C <Ri< lim (7 + J(X; Yr\Xr) 
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Finally, the union of these two regions (11661) and (11681 ) becomes the region given below: 

Re < Rl < I [X;Y,Yr\Xr) 

0<Re<[I {X; Y, Yr\Xr) - I {X; Yr\Xr)] + 
with the constraint /(X^; Y) > Yr\YXr). 



(169) 



Appendix B 
Proof of Theorem [2] 

The converse for i?i is given in [22] using the cut set bound. The converse for i?e can be 
derived by specializing the upper bound in [21], which is stated in (|58l) as I{X;Y\Xr,Yr). For 
our model, this can be upper bounded as: 

/ (X; Y\Yr, X,) = / {Xr, Xn; Y\Yr, X,.) 

= h {Y\Yr, Xr) - h {Y\Yr, X,., Xr, Xd) 

From ([T2l) . Y — Xjj, X.^ — Yr, Xr is a Markov chain. Hence (11711 ) equals: 

h{Y\Yr,Xr)-h{Y\Xr,XD) 

<h{Y\Xr) -h{Y\Xr,XD) 
= I{XD;Y\Xr) 

Hence we have proved the converse for R^.. 

The achievability of (|2T]) also follows from the partial decode-and-forward scheme presented 
in reference [21, Theorem 1]. 

Theorem 8: [21, Theorem 1] The following region is achievable: 

' Ri<I (X; Y\U, Xr) + min{/ (f/; y;.|X,) , / (f/, X,; Y)} 

U ] Re<[I{X-Y\U,Xr)-I{X-Yr\U,Xr)Y 

p{U,X,Xr) 

p{Y^,Y\X,Xr) < Re < Rl 



(170) 
(171) 

(172) 
(173) 
(174) 



(175) 



In (11751) . we let X = {Xo, Xr},U = Xr, and restrict the union to be over the probability 
distributions of the form p{Xr)p{Xci\Xr)p{XR\Xr), and we obtain: 

I {X;Y\U,Xr) - I {X;Yr\U,Xr) (176) 

=/ {XrXd; Y\XRXr) - I {XrXd; r,|X^X,) (177) 

=/ (X^; Y\XRXr) - I {Xn; 1;|XhX,) (178) 
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=H {Y\XRXr) - H {YlXnXRXr) - I (Xn; YrlXnXr) (179) 

= H {Y\Xr) - H {Y\XDXr) - I (Xd; F^IX^X.) (180) 

=1 {Xd; Y\Xr) - I {Xd; 1;|XrX,) (181) 

%(Xz5;r|X,) (182) 

where step (a) follows from Xr — Xr — Y being a Markov chain [22] and Xr — XrXc — Y 
being a Markov chain. Step (b) follows from Xc — XRXr — Yr being a Markov chain [22]. 
Moreover, the bound on Ri can be expressed as: 

/ (X; Y\U, Xr) + mm{I {U; F,|X^) , I ([/, X^; Y)} (183) 

= min| I {UXXr]Y) , I {U-Yr\Xr) + I {X-Y\U,Xr) } (184) 

= min I / (XX,; Y) , I {U; r,|X,) + / (X; Y\U, X,) } (185) 

where step (c) follows from U — XX, — Y being a Markov chain. 

Note that (|185l) is the same as [22, (2)], therefore from the same argument therein, we obtain: 

min I / (XX,; Y) , I {U; F,|X,) + I (X; Y\U, X,) } (186) 

= min I / (Xd, X,; Y) , / (X^; i;|X,) + / (X^; F|X,) } (187) 

By substituting (11871) and (11821) into (11751) . we find that the rate pair in (|2T]) is achievable. 

Remark 17: It is shown in [21, Lemma 3] that the achievable rate region (11751) is convex. 
Therefore the rate region (|2TI) is also convex. □ 

Appendix C 

Proof that (EHl) is a Monotonic Increasing Function of the Source Power 



2 

It suffices to show that the argument of C( ), which is p + -nr^, is a monotonically increasing 
function of p. The expression of ctq is given by (|32|) . Thus we have: 



Since p + is always positive, we can prove its monotonicity in p by showing ln(p + 

Q 

is monotonically increasing in p. From (11881 ). it is given by 



Inp + In ( ( 1 + a^) (h'^Pr + l) {p + I) - a^) - In ((h'^Pr + + 1 ) + 1) - a^) (189) 
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Let A={1 + a^) {b^Pr + 1). Let 5 = {b^Pr + a^ + l). Then the derivative of (fT89l) with respect 
to p is given by 



P (p+l-f) (p+l-f) 

Note that /1,_B > a^. Hence the denominator of (11901 ) is positive. Therefore we only need to 
show that the numerator of (11901) is positive. The numerator of (11901) equals 

(p+lf-a^Q + ^)(. + l) + |^ + a^Q-^). (191) 
= iP+^f-^P + 4^-^"(\ + i) (192) 



B"^ AB \A BJ 



Since A> B > a?, / B < 1, (11941) is positive. Therefore the derivative of ln(p + j^^) with 
respect to p is positive. This means (|38l) is a monotonic increasing function of p. 



Q 



Appendix D 

An Example where introducing a Second Eavesdropper Decreases Secrecy 

Capacity 

Consider a special case of Model 2 defined by 

Yr = X + N Yd = X-N (195) 
Yr = Xr (196) 

This is a Gaussian relay channel with orthogonal components with reversely correlated noise. 

is a zero mean Gaussian random variable with unit variance. Hence its probability density 
function is symmetric around the origin: p{—N) = p{N). 

We first observe that since the orthogonal link between the relay and the destination is 
noiseless, the optimal relaying scheme in this case is choosing X^^i = l^r,j-i- This can be proved 
as follows: First we recognize, for this channel, given Y^'\ the signals X" do not provide more 
information to the eavesdropper. This is because the relay is not interfering itself and hence as 
shown in Remark [TTl the secrecy capacity can be computed from lim„_^oo ^H{W\Y^^) instead. 
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i.e., X" can be dropped from the conditioning term. Therefore for any given relay scheme, we 
can always use Xr^i = l^r,i-i to give the destination the signals received by the relay, and ask the 
destination to to compute the X" generated from the original relaying scheme instead. It can be 
verified that in this way the secrecy constraint is fulfilled and W can still be transmitted reliably. 
Therefore, the secrecy rate achievable by any given relay scheme is achievable via Xr^i = l^r,i-i> 
which must be the optimal relaying scheme. 

Hence the destination essentially receives (Fr,i-i, Yda) at the ith channel use and the eaves- 
dropper receives F^- The channel is therefore equivalent to a 1 x 2 MIMO wiretap channel [4]. 
Note that the destination can remove the noise Ni completely by simply computing y,. j + Yd,^- 
The eavesdropper, on the other hand, observes an AWGN link with finite capacity. Hence the 
secrecy capacity of this channel is easily seen to be oo. 

Now, we construct a second relay channel. The channel is the same as the previous one except 
that the received signal at the relay becomes: 

{Yr,Xr} = {X + N,Xr} (197) 

That is to say that the relay receives an additional copy of its transmitted signal. This should 
not benefit the relay/eavesdropper at all. So the secrecy capacity is still oo . 

Now, we construct a third relay channel from the second relay channel, by adding one more 
eavesdropper to the model. Let the signal received by this second eavesdropper be: 

Y, = {X-N,Xr} (198) 

It follows that p{Ye\X,Xr) = p{X - N,Xr\X,Xr) = p{-N) = p{N) = p(r,|X,X,). Hence, 
the new eavesdropper observes the same marginal distribution as the eavesdropper located at 
the relay node. However, this eavesdropper receives exactly the same signal received by the 
destination. Therefore the secrecy capacity of the new system is reduced to 0. 
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Appendix E 
Proof that (gS]) as a Markov Chain 

Recall that A is shown in ([U) as the local randomness generated at the relay. Using chain rule, 
we have 

p{y-,y:,x-,x:,a) 

n 

= p (X", A) Hp {Xr,\Y:-,\ Xir,\ A, X^^) p (n,, Yr,\Y^\ F;,r\ X^i, X", a) (199) 

i=l 

From ©, we observe that 

Xr, - {Y:;,\ Xir,\ A} - {Y:~i\X-} (200) 

is a Markov chain. Since the channel is memoryless, and the relay function ([T]) has to be causal, 
we observe 

Yr,} - {x,,„ X,} - {y:j\ y;;i\ x^^S A} (20i) 

is also a Markov chain. Applying these two Markov chains to (11991) , we have: 

p (y,", l;^ x^ x;, A) = p (x^ A) f{p(XrAY:j\ xi;^\ a) p (i;,., i;,^|x,,„ x,) (202) 

We next integrate out Y,J.' and X^ from both side of (12021 ). This can be done in a recursive 
fashion as we show next. First we integrate over F^.n on both side of (12021) . This gives us: 

p{Yr\Y:,x\x:;,A) 

n-l 



=p (x^ A) Hp (x,,|y;;r\ x;7\ a) p {y,„ f,,|x,„ x,) 

i=l 

P {Xr,nKr\ ^.TS a) P iye,n\Xr,n, X„) (203) 

From (|43l) . we have p{Ye-ri\Xr,n, Xn) = p(Y'e,r).|X„). Applying it to (12031) yields: 



p(Yr\Y:,x\x:,A) 

n-l 

--P (x^ A) Hp (x,,|y;;r\ x;7\ a) p (i;,, n,!^.,, 

i=l 

P {Xr,nKr\ ^.TS ^) P {ye,n\Xn) (204) 
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We next integrate over Xr^n on both side of (12041) . which yields: 

n-l 

=P (X", A) p (ye,n|X„) n P (Xr,i\y:^\ Kf. A p (Fe,^, ^.^l^,,,, X^) (205) 



i=l 

Repeating the process in (I203I) - (I205I) for n — 1, n — 2, 1, we have 



p{Y:,X-,A) =p{X'',A)Y[p{Y,,\X,) (206) 

i=l 

Integrating over A on both sides of (|206l) . we have 

n 

p(F,",X«) =p(X«)np(>;,l^.) (207) 

i=l 

Integrating over l^j+i both sides of (12071 ). we have 

p{yI„X^-) =p{X^)f[p(Y,,\X,) (208) 
Integrating over Fej on both sides of (12081) . we have 

p {Yi^\ X") = p (X") n P (>^e,|X,) (209) 

i=l 

Dividing each side of (12081 ) by the corresponding side of (12091 ). we have 

p(y,,\YI^\X^) =p(Y,,\X,) (210) 



Hence we have shown that Yej — Xj — Xy^Y^^ is a Markov chain. 

We next prove that F^j — — -^^bl^'^iT^ is a Markov chain. Again, we start with (12021 ) and 
integrate out YJ^ and X" from both side of it in a recursive fashion. First we integrate over Fe,n 
on both side of (12021) and obtain 

p{y;^,y:^^\x^,x^,a) 

n-l 

=p (X", A) Hp (x,,|y;;7\ x^^S a) p i;,|x,„ x,) 
1=1 

P {Xr,nKr\ X^-^\ a) P (n,„|X,,,„, X„) (211) 

Then from (|43] ) we observe that p (Fr,n|-^r,n5 -^n) = P iXr,n\Xn)- Hence (12111) becomes: 

p(y;^F;:^^x^x,^A) 
=p (X", A) n p (^r,l>7;TS a) p (r,,, F,,|x,„ X,) 
p (x,,„|i7:r\ x.^rS a) p (i;,„|x„) (212) 
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We next integrate over Xr^n on both side of (|212|) . which yields: 

p{Y;,Y:^^\x-,xr\A) 

=p (X", A) p {Yr,n\Xn) H P (^r.^ll^.l', Kj', ^) P (Ye,, Y^^Xr^^, Xi) (213) 

i=l 

Repeating the process in (|21 1I) - (I213I) for — 1, n — 2, 1, we have 

n 

p (i;", X", A)=p (X", A) n P {YrAX,) (214) 
Integrating over A on both sides of (|214l) . we have 

=pmnp(i;,i|Xi) (215) 
Integrating over 1^," on both sides of (12151 ). we have 

=p(x")np(y,,,|x,) (216) 

i=l 

Integrating over Yj.^j on both sides of (12161 ). we have 

i-i 

p{Y^;[\X^)=p{X^)X{p{Yr,\Xi) {111) 

i=l 

Dividing each side of (|216l) by the corresponding side of (|217l) . we have 

p{Y,,j\Y^.^\X^) =p(n,,|X,) (218) 
Hence we have shown that F^j — Xj — X]j[F/]~^ is a Markov chain. 

Appendix F 

Proof that H{W\Y^XJ^) = H{W\Y^) for the Model stated in (|4T]) 
We begin with 

= X,", A) = HiWlY;", A) (219) 

where the last equality follows from the fact that X" is a deterministic function of {Yj"',A}. 
Hence we only need to prove H{W\Y^, A) = H{W\Y^) for the channel model defined in (HTI) . 
This can be done as follows: 
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First we factorize p {Y^, X", X", A, W) using a similar procedure seen in (|199l )- (|202|) : 

piY;',X'',Xl^,A,W) (220) 
=p (W, A, X") p {Y;, X,"|X", a, W) (221) 

=p {w, A, X") n p (x,,|F;;r\ x;,i\ a) p (n,|x,,,, x,) (222) 



i=l 



where in (12221) we use the Markov chain stated in (12001 ) and ( I201I ). 
From (|4TI) . we have p (y^.^l-^r,*, -^i) = P {Yr,i\Xi). Hence we have 

n (223) 
=p (ly, A, X^) l[p{Xr,\Y;-,\ Xlj\ a) p (F,,,|X,) 

i=l 

We next integrate out X" from both sides of (|223l) using the procedure shown in Appendix |El 
which yields: 

n 

p{Y:^,X^,A,W)=p{W,X^,A)X{p{YrAX,) (224) 

We next use the fact that y^.i — Xi — X]j[F/J^ is a Markov chain, as stated in (l45l) and proved 
in Appendix |El from which we have 

n 

p (i;", X", A,w)=p {w, x^ A) n p K;;r^) (225) 

i=l 

Since 14^ is a deterministic function of X", we have 



p x^ A, ly) = p {w, X", A) n p >?;rS w) me) 

i=l 

= p(iy,X",A)p(F;^|X",iy) (227) 
Since A is independent from W, X", (|226l) - (|227l) can be written as 

p (n", X", A,W)=p {W, X", F,") p {A) (228) 

From it, we can write: 

p(l;^A)=p(F,")p(A) (229) 

and 

p (y,", A, W^) = p (ly, n")p (A) (230) 
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From (12291) and (12301) . we have 

p{W\Y^,A)=p{W\Y:^) (231) 

Hence H{W\Y^^,A) = 

Appendix G 
Proof that (|79l) is an Upper Bound 



We begin with 

H (W) =H {W\Y^) + / {W; Y^) (232) 

Due to the secrecy constraint, we have lim„^oo ^H^'i ^e) = 0- Due to the fact that W can be 
decoded from Yg,Y^ reliably, we have, from Fano's inequality, lim^^oo {W\YJ^Y]l) = 0. 
Hence there exists e > such that 

/ {W; 17) < ne/2 (233) 

H {W\YSY^) <ne/2 (234) 

lim e = (235) 

n— >oo 

For this e, we find (12321) is upper bounded by 

H {WIY^"") +ne/2 (236) 

<H (ly |F") - H {W\YgY^) + ne (237) 

<H iWlY^"") - H {WIY^Y^X;^) + ne (238) 

</ (VT; y^, r^, X,"|F,") + (239) 

</ (VT, X*^; 17, r^, X;|i;") + ne (240) 

=/ (X"; F^, F^, X^ iy;^) + ne (241) 

=/ (X"; 17, X;^|i;") + / (X"; Yg, X:;) + ne (242) 

n 

=1 (X"; 17, X,"|i;") + ^ / (X"; 1^, X«, Y},;,') + ne (243) 



i=l 



From ([55]), we observe Yj^j - {Y^"", Yg, X^, Y^;/} - X" is a Markov chain. Hence (12431) equals: 



I (X"; Yg, X,"|F,") + ne (244) 

<I (X"; Yg, i;", X;|F,") + (245) 

=/ (X"; Yg, i;"|F,") + / (X"; X;|F^, i;", Y^) + ne (246) 

=1 (X^ r^, F,"|F,") + ^ / (X"; X,,i|X;-i\ r^, F,", y,") + ne (247) 
The term inside the sum in (12471) can be bounded as 

/ (X"; X,,,,|X-\ Yg, i;", F,") (248) 

</ (X"; A, X^i i;", 17) (249) 

</ (X"; F,", F,") + / (X"; X'.jYg, Y^, Y^ , A) (250) 

=J(X";A|y^,i;",17) (251) 



where A is the local randomness at the relay. (12511) is due to the fact that X^^ is a deterministic 
function of {Y^ , A}. 
From (l55l) . we have 

n 

p (X", X!;, A, Yg, y,") = p (X") p (A) n P (>^D„ Yr„ Y,,\Xi) p (x,,i|x-i, a) 

1=1 

(252) 

from which we have 

n 

p (X-, Yg, F,", F,", A) = p (X") p{A)l[p {Yd,, Y,„ Fe,|X,) (253) 

1=1 

Hence {F^, l^, FJ', X"} are all independent from A. Therefore (I25T]) equals 0. (12471) thus 
becomes: 

J(X";F^,F,"|F,")+n£ (254) 

n n 

= Y,h {YD,Yr,,\Yg-lY:-{^Y:) -Y^h {YD.YrAYg-^Y^J^Y^X'') + ne (255) 

i=l i=l 

n n 

<J2h {Yd,Y,,\Y,,) -J2h {YD,^r,\Yg;lY;:^'Y:X^) + ne (256) 

i=l i=l 

From (1551) and the fact that the channel is memoryless and the relay function is causal, we 
observe that 

{Yn„Yr,} - {F,„X,} - {F2,-;i\F;;r\F,,],[,X],[} (257) 
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is a Markov chain. Hence (12561) equals: 

n n 

J2 h {Yn,Yr,,\Y,,) - ^ /i {YD,Yr,i\ye,X.) + ne (258) 

i=l i=l 

n 

= J2l (X,; YD,iYr,\Y,,) + ne (259) 

i=l 

Define Q as a random variable that is uniformly distributed over {l,2,...,n}. Define X = 
Xq, Yd = Yd,q, Yr = Yr,Q, Ye = Ye,Q. Then ([2591) equals: 

nl {X; YoYrlYeQ) + ne (260) 

=n {h {YoYrlY^Q) - h (YoYrlYeXQ)) + ne (261) 

<n {h {YoYrlYe) - h {YDYr\YeXQ)) + ne (262) 
Since {Yd, Yr} — {Ye, X} — Q is a Markov chain, (12621) equals: 

n {h (YDYrlYe) - h {YDYr\YeX)) + ne (263) 
=nl {X; YoYrlYe) + ne (264) 

Dividing both sides by n and letting n — s> oo, we have the upper bound in (1791) . 
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